A low bit rate speech coding method using a formant-articulatory parameter nomogram
نویسندگان
چکیده
In this paper, we propose a new method for low bit rate speech coding using a nomogram that is a pair of codebooks representing the functional relationship between formant frequencies and articulatory parameters. Significant features of our approach are 1) using the codebooks derived theoretically from the computation using a stylized vocal tract model and 2) independent coding by separating frequency information from the amplitude in a speech segment. From these features, the method is also characterized by little dependency upon speech databases and/or languages in the acoustic domain, so that it has a potential to construct a more flexible rule-based speech synthesis system. We have conducted articulatory encode-decode experiments with the bit rate range from 3.2kbps to 1.6kbps using speech samples in ASJ and TIMIT speech databases and confirmed that good quality speech synthesis is achieved with improvements on the bit allocation scheme and a frame sampling method.
منابع مشابه
Segmental feature extraction and coding for speech synthesis
This paper describes a segmental feature extraction and speech coding method in an acousticarticulatory domain using nomograms that represent a mapping between formant frequencies and articulatory parameters. The vocal tract model is a modified Fant model, in which we newly introduced a parameter for successively adjusting vocal tract lengths. We investigated first the relationship between form...
متن کاملSegmental Featurs Extraction and Coding for Speech Synthesis
This paper describes a segmental feature extraction and speech coding method in an acousticarticulatory domain using nomograms that represent a mapping between formant frequencies and articulatory parameters. The vocal tract model is a modified Fant model, in which we newly introduced a parameter for successively adjusting vocal tract lengths. We investigated first the relationship between form...
متن کاملA hybrid time-frequency domain articulatory speech synthesizer
High quality speech at low bit rates (e.g., 2400 bits/s) is one of the important objectives of current speech research. As part of long range activity on this problem, we have developed an efficient computer program that will serve as a tool for investigating whether articulatory speech synthesis may achieve this low bit rate. At a sampling frequency of 8 kHz, the most comprehensive version of ...
متن کاملEstimation of articulatory parameter trajectory from speech acoustic dynamics
This research aims to perform articulatory analysis as a basis for low bit-rate speech coding. The classical approach consists of gathering a large set of acoustic and articulatory vector pairs in a codebook. Then, based on some criteria, the non-uniqueness of the articulatory trajectories is solved using a dynamic optimization procedure. An articulatory codebook requires a model capable of gen...
متن کاملArticulatory analysis using a codebook for articulatory based low bit-rate speech coding
Fundamental to the success of the articulatory based speech coding is the mapping from acoustics to articulatory description. As the mapping is not unique and based on articulatory continuity criteria, the non-uniqueness of the articulatory trajectories is solved using a forward dynamic network. In this paper, we present new results on forward dynamic network used to estimate articulatory traje...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000